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DNA sequence of T.reesei cbhl signal sequence+catalytic domain+linker (1570 bases) 



ATGTATCGO A aGTTGGCCGTCATCTCOGCCTTCTTGGCCACAGCTCGTGCT CA 

GTCGGCCTGCACTCTCCAATCGGAGACTCACCCGCCTCTGACATGGCAG 

AAATGCTCGTCTGGTGGCACTTGCACTCAACAGACAGGCTCCGTGGTCA 

TCGACGCCAACTGGCGCTGGACTCACGCTACGAACAGCAGCACGAACTG 

CTACGATGGCAACACTTGGAGCTCGACCCTATGTCCTGACAACGAGACC 

TGCGCGAAGAACTGCTGTCTGGACGGTGCCGCCTACGCGTCCACGTACG 

GAGTTACCACGAGCGGTAACAGCCTCTCCATTGGCTTTGTCACCCAGTC 

TGCGCAGAAGAACGTTGGCGCTCGCCTTTACCTTATGGCGAGCGACACG 

ACCTACCAGGAATTCACCCTGCTTGGCAACGAGTTCTCTTTCGATGTTGA 

TGTTTCGCAGCTGCCGTAAGTGACTTACCATGAACCCCTGACGTATCTTC 

TTGTGGGCTCCCAGCTGACTGGCCAATTTAAGGTGCGGCTTGAACGGAG 

CTCTCTACTTCGTGTCCATGGACGCGGATGGTGGCGTGAGCAAGTATCC 

CACCAACACCGCTGGCGCCAAGTACGGCACGGGGTACTGTGACAGCCAG 

TGTCCCCGCGATCTGAAGTTCATCAATGGCCAGGCCAACGTTGAGGGCT 

GGGAGCCGTCATCCAACAACGCAAACACGGGCATTGGAGGACACGGAA 

GCTGCTGCTCTGAGATGGATATCTGGGAGGCCAACTCCATCTCCGAGGC 

TCTTACCCCCCACCCTTGCACGACTGTCGGCCAGGAGATCTGCGAGGGT 

GATGGGTGCGGCGGAACTTACTCCGATAAeAGATATGGCGGCACTTGCG 

ATCCCGATGGCTGCGACTGGAACCCATACCGCCTGGGCAACACCAGCTT 

CTACGGCCCTGGCTCAAGCTTTACCCTGGATACCACCAAGAAATTGACC 

GTTGTCACCCAGTTCGAGACGTCGGGTGCCATCAACCGATACTATGTCC 

AGAATGGCGTCACTTTCCAGCAGCCCAACGCCGAGCTTGGTAGTTACTC 

TGGCAACGAGCTCAACGATGATTACTGCACAGCTGAGGAGGCAGAATTC 

GGCGGATCCTCTTTCTCAGACAAGGGCGGCCTGACTCAGTTCAAGAAGG 

CTACCTCTGGCGGCATGGTTCTGGTCATGAGTCTGTGGGATGATGTGAG 

TTTGATGGACAAACATGCGCGTTGACAAAGAGTCAAGCAGCTGACTGAG 

ATGTTACAGTACTACGCCAACATGCTGTGGCTGGACTCCACCTACCCGA 

CAAACGAGACCTCCTCCACACCCGGTGCCGTGCGCGGAAGCTGCTCCAC 

CAGCTCCGGTGTCCCTGCTCAGGTCGAATCTCAGTCTCCCAACGCCAAG 

GTCACCTTCTCCAACATCAAGTTCGGACCCATTGGCAGCACCGGGAACC 

CTAGCGGCGGCAACCCrCCCGGCGG^CCCGCCrGGC^CC^CC^CC^CCCG 

CCGCCCAGCCACTACCACTGGAAGCTCTCCCGGACCTACTAGT 
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Figure 3 

Amino acid sequence of T.reesei cbhl signal sequence + catalytic domain + linker (480 
amino acids) 

MVR T<T T . A VT S AFL ATARA OS ACTLQ SETHPPLTW QKCS SGGTCTQQTG S WID 

ANWRWTHATNSSTNCYDGNTWSSTLCPDNETCAKNCCLDGAAYASTYGVT 

TSGNSLSIGFVTQSAQKNVGARLYLMASDTTYQEFTLLGNEFSFDVDVSQLP 

CGLNGALYFVSMDADGGVSKYPTNTAGAKYGTGYCDSQCPRDLKFINGQA 

NVEGWEPSSNNANTGIGGHGSCCSEMDIWEANSISEALTPHPCTTVGQEICE 

GDGCGGTYSDNRYGGTCDPDGCDWNPYRLGNTSFYGPGSSFTLDTTKKLT 

WTOFETSGAINRYYVQNGVTFQQPNAELGSYSGNELNDDYCTAEEAEFGG 

SSFSDKGGLTOFKKATSGGMVLVMSL\VDDYYANML\VLDSTYPTNETSSTP 

GAVRGSCSTSSGVPAQVESQSPNAKVTFSNIKFGPIGSTGNP5GG7/PPGG7^PPG 

TTTTRRPA TTTGSSPGPTS 
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Figure 4 

DNA sequence of Acidothermus cellulolyticus GH5A endoglucanase 1 catalytic domain 
(1077 bases) 

GCGGGCGGCGGCTATTGGCACACGAGCGGCCGGGAGATCCTGGACGCGAAC 

AACGTGCCGGTACGGATCGCCGGCATCAACTGGTTTGGGTTCGAAACCTGCA 

ATTACGTCGTGCACGGTCTCTGGTCACGCGACTACCGCAGCATGCTCGACCA 

GATAAAGTCGCTCGGCTACAACACAATCCGGCTGCCGTACTCTGACGACATT 

CTCAAGCCGGGCACCATGCCGAACAGCATCAATTTTTACCAGATGAATCAGG 

ACCTGCAGGGTCTGACGTCCTTGCAGGTCATGGACAAAATCGTCGCGTACGC 

CGGTCAGATCGGCCTGCGCATCATTCTTGACCGCCACCGACCGGATTGCAGC 

GGGCAGTCGGCGCTGTGGTACACGAGCAGCGTCTCGGAGGCTACGTGGATTT 

CCGACCTGCAAGCGCTGGCGCAGCGCTACAAGGGAAACCCGACGGTCGTCG 

GCTTTGACTTGCACAACGAGCCGCATGACCCGGCCTGCTGGGGCTGCGGCGA 

TCCGAGCATCGACTGGCGATTGGCCGCCGAGCGGGCCGGAAACGCCGTGCTC 

TCGGTGAATCCGAACCTGCTCATTTTCGTCGAAGGTGTGCAGAGCTACAACG 

GAGACTCCTACTGGTGGGGCGGCAACCTGCAAGGAGCCGGCCAGTACCCGGT 

CGTGCTGAACGTGCCGAACCGCCTGGTGTACTCGGCGCACGACTACGCGACG 

AGCGTCTACCCGCAGACGTGGTTCAGCGATCCGACCTTCCCCAACAACATGC 

CCGGCATCTGGAACAAGAACTGGGGATACCTCTTCAATCAGAACATTGCACC 

GGTATGGCTGGGCGAATTCGGTACGACACTGCAATCCACGACCGACCAGACG 

TGGCTGAAGACGCTCGTCCAGTACCTACGGCCGACCGCGCAATACGGTGCGG 

ACAGCTTCCAGTGGACCTTCTGGTCCTGGAACCCCGATTCCGGCGACACAGG 

AGGAATTCTCAAGGATGACTGGCAGACGGTCGACACAGTAAAAGACGGCTAT 

CTCGCGCCGATCAAGTCGTCGATTTTCGATCCTGTCGGC 
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Figure 5 

Amino acid sequence of Acidothermus cellulolyticus GH5A endoglucanase 1 catalytic 
domain (359 amino acids) 

AGGGYWHTSGREILDANNVPVRJAGINWFGFETCNYVVHGLWSRDYRSMLDQI 

KSLGYNTIRLPYSDDILKPGTMPNSINFYQMNQDLQGLTSLQVMDKIVAYAGQIG 

LRIILDRHRPDCSGQSALWYTSSVSEATWISDLQALAQRYKGNPTVVGFDLHNEP 

HDPACWGCGDPSIDWPJLAAERAGNAVLSVNPNLLIFVEGVQSYNGDSYWWGG 

NLOGAGQYPVVLNVPNRLVYSAHDYATSVYPQTWFSDPTFPNNMPGIWNKNW 

GYLFNQNIAPVWLGEFGTTLQSTTDQTWLKTLVQYLRPTAQYGADSFQWTFWS 

WNPDSGDTGGILKDDWQTVDTVKDGYLAPIKSSIFDPVG 
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FIGURE 6 

DNA sequence of Acidothermus cellulolyticus GH48 cellulose catalytic domain (1914 
bases) 

AACGACCCGTACATCCAGCGGTTCCTCACGATGTACAACAAGATTCACGACC 

CAGCGAACGGCTACTTCAGCCCGCAGGGAATTCCCTACCACTCGGTAGAAAC 

GCTCATCGTTGAGGCACCGGACTACGGGCACGAGACAACTTCGGAGGCGTAC 

AGCTTCTGGCTCTGGCTCGAAGCGACGTACGGCGCAGTGACCGGCAACTGGA 

CGCCGTTCAACAACGCCTGGACGACGATGGAAACGTACATGATCCCGCAGCA 

CGCGGACCAGCCGAACAACGCGTCGTACAACCCCAACAGCCCGGCGTCGTAC 

GCTCCGGAAGAGCCGCTGCCCAGCATGTACCCGGTTGCCATCGACAGCAGCG 

TGCCGGTTGGGCACGACCCGCTCGCCGCCGAATTGCAGTCGACGTACGGCAC 

TCCGGACATTTACGGCATGCACTGGCTGGCCGACGTTGACAACATCTACGGA 

TACGGCGACAGCCCCGGCGGTGGTTGCGAACTCGGTCCTTCCGCTAAGGGCG 

TCTCCTACATCAACACATTCCAGCGCGGCTCGCAGGAGTCCGTCTGGGAGAC 

GGTCACCCAGCCGACGTGCGACAACGGCAAGTACGGTGGGGCGCACGGCTA 

CGTCGACCTGTTCATCCAGGGTTCGACGCCGCCGCAGTGGAAGTACACCGAT 

GCCCCGGACGCCGACGCCCGTGCCGTCCAGGCTGCGTACTGGGCCTACACCT 

GGGCATCGGCGCAGGGCAAGGCAAGCGCGATTGCCCCGACGATCGCCAAGG 

CGGCCAAACtCGGCGACTACCTGCGGTACTCGCTCTTTGACAAGTACTTCAAG 

CAGGTCGGCAACTGCTACCGGGCCAGCTCCTGCCCTGGAGCAACCGGACGCC 

AGAGCGAGACCTACCTGATCGGCTGGTACTACGCCTGGGGCGGCTCAAGCCA 

AGGCTGGGCCTGGCGCATTGGTGACGGCGCCGCGCACTTCGGCTACCAGAAT 

CCGCTTGCCGCGTGGGCGATGTCGAACGTGACACCGCTCATTCCGCTCTCGCC 

CACGGCAAAGAGCGACTGGGCGGCGAGCTTGCAGCGCCAGCTGGAGTTCTAC 

CAGTGGTTGCAATCCGCGGAAGGAGCCATTGCGGGCGGCGCCACCAACAGCT 

GGAACGGCAATTACGGGACCCCGCCGGCCGGAGACTCGACCTTCTACGGCAT 

GGCGTACGACTGGGAGCCGGTCTACCACGACCCGCCGAGCAACAACTGGTTC 

GGCTTCCAGGCGTGGTCCATGGAACGGGTTGCCGAGTACTACTACGTCACCG 

GCGACCCGAAGGCCAAGGCGCTGCTCGACAAGTGGGTCGCATGGGTGAAGC 

CGAATGTCACCACCGGTGCCTCATGGTCGATTCCGTCGAATTTGTCCTGGAGC 

GGCCAACCGGATACCTGGAATCCGAGCAACCCAGGAACGAATGCCAACCTG 

CACGTGACCATCACGTCGTCCGGGCAGGACGTCGGTGTTGCCGCGGCGCTCG 

CGAAGACACTCGAGTACTACGCGGCAAAATCCGGCGATACGGCCTCGCGCGA 

CCTCGCGAAGGGATTGCTCGACTCCATGTGGAACAACGACCAGGACAGCCTC 

GGTGTGAGCACACCGGAGACGCGGAGCGACTACTCTCGGTTCACTCAGGTGT 

ACGACCCGACGACTGGTGACGGCCTCTACATCCCGTCGGGTTGGACGGGGAC 

CATGCCCAACGGTGACCAAATCAAGCCGGGTGCGACCTTCCTGAGCAXCCGG 

TCCTGGTACACCAAGGATCCGCAGTGGTCGAAGGTGCAGGCGTACCTCAACG 

GCGGGCCTGCTCCGACGTTCAACTACCACCGGTTCTGGGCGGAGTCCGACTT 

CGCGATGGCGAACGCCGATTTTGGCATGCTCTTCCCATCCGGG 
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FIGURE 7 

Amino acid sequence of Acidothermus cellulolyticus 48 catalytic domain (638 amino 
acids) 

NDPYIQRFLTMYNKIHDPANGYFSPQGIPYHSVETLIVEAPDYGHETTSEAYSFW 

LWLEATYGAVTGNWTPFKNAWTTMETYM1PQHADQPNNASYNPNSPASYAPEE 

PLPSMYPVAIDSSVPVGHDPLAAELQSTYGTPDIYGMHWLADVDNIYGYGDSPG 

GGCELGPSAKGVSYINTFQRGSQESVWETVTQPTCDNGKYGGAHGYVDLFIQGS 

TPPQWKYTDAPDADARAVQAAYWAYTWASAQGKASAIAPTIAKAAKLGDYLR 

YSLFDKYFKQVGNCYPASSCPGATGRQSETYLIGWYYAWGGSSQGWAWRIGD 

GAAHFGYQNPLAAWAMSNVTPLIPLSPTAKSDWAASLQRQLEFYQWLQSAEGA 

IAGGATNSWNGNYGTPPAGDSTFYGMAYDWEPVYHDPPSNNWFGFQAWSMER 

VAEYYYVTGDPKAKALLDKWVAWVKPNVTTGASWSIPSNLSWSGQPDTWNPS 

NPGTNANLFTVTITS SGQDVGV AAALAKTLE YYAAKSGDTASRDLAKGLLDSMW t 

NNDQDSLGVSTPETRTDYSRFTQVYDPTTGDGLYIPSGWTGTMPNGDQIKPGAT 

FLSIRSWYTKDPQWSKVQAYLNGGPAPTFNYHRFWAESDFAMANADFGMLFPS 

G 



2' 
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FIGURE 8A 

DNA sequence of Acidothermus cellulolyticus GH74 catalytic domain 

GCGACGACTCAGCCGTACACCTGGAGCAACGTGGCGATCGGGGGCGGCGGC 

TTTGTCGACGGGATCGTCTTCAATGAAGGTGCACCGGGAATTCTGTACGTGCG 

GACGGACATCGGGGGGATGTATCGATGGGATGCCGCCAACGGGCGGTGGAT 

CCCTCTTCTGGATTGGGTGGGATGGAACAATTGGGGGTACAACGGCGTCGTC 

AGCATTGCGGCAGACCCGATCAATACTAACAAGGTATGGGCCGCCGTCGGAA 

TGTACACCAACAGCTGGGACCCAAACGACGGAGCGATTCTCCGCTCGTCTGA 

TCAGGGCGCAACGTGGCAAATAACGCCCCTGCCGTTCAAGCTTGGCGGCAAC 

ATGCCCGGGCGTGGAATGGGCGAGCGGCTTGCGGTGGATCCAAACAATGACA 

ACATTCTGTATTTCGGCGCCCCGAGCGGCAAAGGGCTCTGGAGAAGCACAGA 

TTCCGGCGCGACCTGGTCCCAGATGACGAACTTTGCGGACGTAGGCACGTAC 

ATTGCAAATCCCACTGACACGACCGGCTATCAGAGCGATATTCAAGGCGTCG 

TCTGGGTCGCTTTCGACAAGTCTTCGTCATCGCTCGGGCAAGCGAGTAAGACC 

ATTTTTGTGGGCGTGGCGGATCCCAATAATCCGGTCTTCTGGAGCAGAGACG 

GCGGCGCGACGTGGCAGGCGGTGCCGGGTGCGCCGACCGGCTTCATCCCGCA 

CAAGGGCGTCTTTGACCCGGTCAACCACGTGCTCTATATTGCCACCAGCAAT 

ACGGGTGGTCCGTATGACGGGAGCTCCGGCGACGTCTGGAAATTCTCGGTGA 

CCTCCGGGACATGGACGCGAATCAGCCCGGTACCTTCGACGGACACGGCCAA 

CGACTACTTTGGTTACAGCGGCCTCACTATCGACCGCCAGCACCCGAACACG 

ATAATGGTGGCAACCCAGATATCGTGGTGGCCGGACACCATAATCTTTCGGA 

GCACCGACGGCGGTGCGACGTGGACGCGGATCTGGGATTGGACGAGTTATCC 

CAATCGAAGCTTGCGATATGTGCTTGACATTTCGGCGGAGCCTTGGCTGACCT 

TCGGCGTACAGCCGAATCCTCCCGTACCGAGTCCGAAGCTCGGCTGGATGGA 

TGAAGCGATGGCAATCGATCCGTTCAACTCTGATCGGATGCTCTACGGAACA 

GGCGCGACGTTGTACGCAACAAATGATGTCACGAAGTGGGACTCCGGCGGCC 

AGATTCATATCGCGCCGATGGTCAAAGGATTGGAGGAGACGGCGGTAAACG 

ATCTCATCAGCCCGCCGTCTGGCGCCCCGCTCATCAGCGCTCTCGGAGACCTC 

GGCGGCTTCACCCACGCCGACGTTACTGCCGTGCCATCGACGATCTTCACGTC 
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FIGURE 8B 

ACCGGTGTTCACGACCGGCACCAGCGTCGACTATGCGGAATTGAATCCGTCG 

ATCATCGTTCGCGCTGGAAGTTTCGATCCAT.CGAGCCAACCGAACGACAGGC 

ACGTCGCGTTCTCGACAGACGGCGGCAAGAACTGGTTCCAAGGCAGCGAACC 

TGGCGGGGTGACGACGGGCGGCACCGTCGCCGCATCGGCCGACGGCTCTCGT 

TTCGTCTGGGCTCCCGGCGATCCCGGTCAGCCTGTGGTGTACGCAGTCGGATT 

TGGCAACTCCTGGGCTGCTTCGCAAGGTGTTCCCGCCAATGCCCAGATCCGCT 

CAGACCGGGTGAATCCAAAGACTTTCTATGCCCTATCCAATGGAACCTTCTAT 

CGAAGCACGGACGGCGGCGTGACATTCCAACCGGTCGCGGCCGGTCTTCCGA 

GCAGCGGTGCCGTCGGTGTCATGTTCCACGCGGTGCCTGGAAAAGAAGGCGA 

TCTGTGGCTCGCTGCATCGAGCGGGCTTTACCACTCAACCAATGGCGGCAGC 

AGTTGGTCTGCAATCACCGGCGTATCCTCCGCGGTGAACGTGGGATTTGGTA 

AGTCTGCGCCCGGGTCGTCATACCCAGCCGTCTTTGTCGTCGGCACGATCGGA 

GGCGTTACGGGGGCGTACCGCTCCGACGACGGTGGGACGACCTGGGTACGG 

ATCAATGATGACCAGCACCAATACGGAAATTGGGGACAAGCAATCACCGGTG 

ACCCGCGAATTTACGGGCGGGTGTACATAGGCACGAACGGCCGTGGAATTGT 

CTACGGGGACATTGGTGGTGCGCCGTCCGGATCG 
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FIGURE 9 

Amino acid sequence of Acidothermus cellulolyticus 74 catalytic domain (741 amino 
acids) 

ATTQPYTWSNVAIGGGGFVDGIVFNEGAPGILYVRTDIGGMYRWDAANGRWIPL 

LDWVGWNNWGYNGVVSIAADPINTNKVWAAVGMYTNSWDPNDGAILRSSDQ 

GATWQITPLPFKLGGNMPGRGMGERLAVDPNNDNILYFGAPSGKGLWRSTDSG 

ATWSQMTNFPDVGTY1ANPTDTTGYQSDIQGVVWVAFDKSSSSLGQASKTIFVG 

VADPNNPVFWSRDGGATWQAVPGAPTGFIPHKGVFDPVNHVLYIATSNTGGPY 

DGSSGDVWKFSVTSGTWTR3SPVPSTDTANDYFGYSGLTIDRQHPNTIMVATQIS 

WWPDTIIFRSTDGGATWTRIWDWTSYPNRSLRYVLDISAEPWLTFGVQPNPPVPS 

PKLGWMDEAMAIDPFNSDPJVILYGTGATLYATODLTKWDSGGQIHIAPMVKGLE 

ETAVNDLISPPSGAPLISALGDLGGFTHADVTAVPSTIFTSPVFTTGTSVDYAELNP 

SIIVPvAGSFDPSSQPNDRHVAFSTDGGKNWFQGSEPGGVTTGGTVAASADGSRFV 

WAPGDPGQPVVYAVGFGNSWAASQGVPANAQIRSDRVNPKTFYALSNGTFYRS 

TDGGVTFQPVAAGLPSSGAVGVMFHAVPGKEGDLWLAASSGLYHSTNGGSSWS 

AITGVSSAVNVGFGKSAPGSSYPAVF^VGTIGGVTGAYRSDDGGTTWVRINDDQ 

HQYGNWGQAITGDPRIYGRVYIGTNGRGIVYGDIGGAPSGS 
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Figure 10 

DNA sequence of Thermobifidafusca E3 (TfE3) cellulase including the cellulose binding 
domain - linker - cataltyic domain but lacking a Tf£3 signal sequence. (1677 bases) 

GCCGGCTGCTCGGTGGACTACACGGTCAACTCCTGGGGTACCGGGTTCACCG 

CCAACGTCACCATCACCAACCTCGGCAGTGCGATCAACGGCTGGACCCTGGA 

GTGGGACTTCCCCGGCAACCAGCAGGTGACCAACCTGTGGAACGGGACCTAC 

ACCCAGTCCGGGCAGCACGTGTCGGTCAGCAACGCCCCGTACAACGCCTCCA 

TCCCGGCCAACGGAACGGTTGAGTTCGGGTTCAACGGCTCCTACTCGGGCAG 

CAACGACATCCCCTCCTCCTTCAAGCTGAACGGGGTTACCTGCGACGGCTCG 

GACGACCCCGACCCCGAGCCCAGCCCCTCCCCCAGCCCTTCCCCCAGCCCCA 

CAGACCCGGATGAGCCGGGCGGCCCGACCAACCCGCCCACCAACCCCGGCG 

AGAAGGTCGACAACCCGTTCGAGGGCGCCAAGCTGTACGTGAACCCGGTCTG 

GTCGGCCAAGGCCGCCGCTGAGCCGGGCGGTTCCGCGGTCGCCAACGAGTCC 

ACCGCTGTCTGGCTGGACCGTATCGGCGCCATCGAGGGCAACGACAGCCCGA 

CCACCGGCTCCATGGGTCTGCGCGACCACCTGGAGGAGGCCGTCCGCCAGTC 

CGGTGGCGACCCGCTGACCATCCAGGTCGTCATCTACAACCTGCCCGGCCGC 

GACTGCGCCGCGCTGGCCTCCAACGGTGAGCTGGGTCCCGATGAACTCX5ACC 

GCTACAAGAGCGAGTACATCGACCCGATCGCCGACATCATGTGGGACTTCGC 

AGACTACGAGAACCTGCGGATCGTCGCCATCATCGAGATCGACTCCCTGCCC 

AACCTCGTCACCAACGTGGGCGGGAACGGCGGCACCGAGCTCTGCGCCTACA 

TGAAGCAGAACGGCGGCTACGTCAACGGTGTCGGCTACGCCCTCCGCAAGCT 

GGGCGAGATCCCGAACGTCTACAACTACATCGACGCCGCCCACCACGGCTGG 

ATCGGCTGGGACTCCAACTTCGGCCCCTCGGTGGACATCTTCTACGAGGCCG 

CCAACGCCTCCGGCTCCACCGTGGACTACGTGCACGGCTTCATCTCCAACAC 

GGCCAACTACTCGGCCACTGTGGAGCCGTACCTGGACGTCAACGGCACCGTT 

AACGGCCAGCTCATCCGCCAGTCCAAGTGGGTTGACTGGAACCAGTACGTCG 

ACGAGCTCTCCTTCGTCCAGGACCTGCGTCAGGCCCTGATCGCCAAGGGCTTC 

CGGTCCGACATCGGTATGCTCATCGACACCTCCCGCAACGGCTGGGGTGGCC 

CGAACCGTCCGACCGGACCGAGCTCCTCCACCGACCTCAACACCTACGTTGA 

CGAGAGCCGTATCGACCGCCGTATCCACCCCGGTAACTGGTGCAACCAGGCC 

GGTGCGGGCCTCGGCGAGCGGCCCACGGTCAACCCGGCTCCCGGTGTTGACG 

CCTACGTCTGGGTGAAGCCCCCGGGTGAGTCCGACGGCGCCAGCGAGGAGAT 

CCCGAACGACGAGGGCAAGGGCTTCGACCGCATGTGCGACCCGACCTACCAG 

GGCAACGCCCGCAACGGCAACAACCCCTCGGGTGCGCTGCCCAACGCCCCCA 

TCTCCGGCCACTGGTTCTCTGCCCAGTTCCGCGAGCTGCTGGCCAACGCCTAC 

CCGCCTCTGTAA 
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Figure 1 1 

Amino acid sequence of the Thermobifida fusca E3 - cellulase including the cellulose 
binding domain - linker - cataltyic domain but lacking the TfE3 signal sequence. (558 
amino acids) 

AGCSVDYTVNSWGTGFTANVTITNLGSAINGWTLEWDFPGNQQVTNLWNGTYT 

QSGQHVSVSNAPYNASIPANGTVEFGFNGSYSGSNDIPSSFKLNGVTCDGSDDPD 

PEPSPSPSPSPSPTDPDEPGGPTNPPTOPGEKVDNPFEGAKLYVNPVWSAKAAAEP 

GGSAVANESTAVWLDRIGAIEGNDSPTTGSMGLRDHLEEAVRQSGGDPLTIQWI 

YNLPGRDCAALASNGELGPDELDRYKSEYIDPIADIMWDFADYENLPJVA1IEIDS 

LPNLVTNVGGNGGTELCAYMKQNGGYVNGVGYALRKLGEIPNVYNYIDAAHH 

GWIGWDSNFGPSVDIFYEAANASGSTVDYVHGFISNTANYSATVEPYLDVNGTV 

NGQLIRQSKWVDWNQYVDELSFVQDLRQALIAKGFRSDIGMLIDTSRNGWGGP 

NRPTGPSSSTDLNTYVDESRIDRRIHPGNWCNQAGAGLGERPTVNPAPGVDAYV 

WVKPPGESDGASEEIPNDEGKGFDRMCDPTYQGNARNGNNPSGALPNAPISGH 

WFSAQFRELLANAYPPL 
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Figure 12 

DNA sequence of Thermobifida fusca E5 (TfE5) endoglucanase including the cellulose 
binding domain - linker and catalytic domain but lacking a TfE5 signal sequence. (1293 
bases) 

GCCGGTCTCACCGCCACAGTCACCAAAGAATCCTCGTGGGACAACGGCTACT 

CCGCGTCCGTCACCGTCCGCAACGACACCTCGAGCACCGTCTCCCAGTGGGA 

GGTCGTCCTCACCCTGCCCGGCGGCACTACAGTGGCCCAGGTGTGGAACG'CC 

CAGCACACCAGCAGCGGCAACTCCCACACCTTCACCGGGGTTTCCTGGAACA 

GCACCATCCCGCCCGGAGGCACCGCCTCTTCCGGCTTCATCGCTTCCGGCAGC 

GGCGAACCCACCCACTGCACCATCAACGGCGCCCCCTGCGACGAAGGCTCCG 

AGCCGGGCGGCCCCGGCGGTCCCGGAACCCCCTCCCCCGACCCCGGCACGCA 

GCCCGGCACCGGCACCCCGGTCGAGCGGTACGGCAAAGTCCAGGTCTGCGGC 

ACCCAGCTCTGCGACGAGCACGGCAACCCGGTCCAACTGCGCGGCATGAGCA 

CCCACGGCATCCAGTGGTTCGACCACTGCCTGACCGACAGCTCGCTGGACGC 

CCTGGCCTACGACTGGAAGGCCGACATCATCCGCCTGTCCATGTACATCCAG 

GAAGACGGCTACGAGACCAACCCGCGCGGCTTCACCGACCGGATGCACCAG 

CTCATCGACATGGCCACGGCGCGCGGCCTGTACGTGATCGTGGACTGGCACA 

TCCTCACCCCGGGCGATCCCCACTACAACCTGGACCGGGCCAAGACCTTCTTC 

GCGGAAATCGCCCAGCGCCACGCCAGCAAGACCAACGTGCTCTACGAGATCG 

CCAACGAACCCAACGGAGTGAGCTGGGCCTCCATCAAGAGCTACGCCGAAG 

AGGTCATCCCGGTGATCCGCCAGCGCGACCCCGACTCGGTGATCATCGTGGG 

CACCCGCGGCTGGTCGTCGCTCGGCGTCTCCGAAGGCTCCGGCCGCGCCGAG 

ATCGCGGCCAACCCGGTCAACGCCTCCAACATCATGTACGCCTTCCACTTCTA 

CGCGGCCTCGCACCGCGACAACTACCTCAACGCGCTGCGTGAGGCCTCCGAG 

CTGTTCCCGGTCTTCGTCACCGAGTTCGGCACCGAGACCTACACCGGTGACG 

GCGCCAACGACTTCCAGATGGCCGACCGCTACATCGACCTGATGGCGGAACG 

GAAGATCGGGTGGACCAAGTGGAACTACTCGGACGACTTCCGTTCCGGCGCG 

GTCTTCCAGCCGGGCACCTGCGCGTCCGGCGGCCCGTGGAGCGGTTCGTCGC 

TGAAGGCGTCCGGACAGTGGGTGCGGAGCAAGCTCCAGTCCTGA 
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Amino acid sequence of the Thermobifidafusca E5 -celiulase including the cellulose 
binding domain - linker - catalytic domain but lacking a TfE5 signal sequence. (430 
amino acids) 

AGLTAWTKESSWDNGYSASVTVRNDTSSTVSQWEVVLTLPGGTTVAQVWNAQ 

HTSSGNSHTFTGVSWNSTIPPGGTASSGFIASGSGEPTHCTINGAPCDEGSEPGGP 

GGPGTPSPDPGTQPGTGTPVERYGKVQVCGTQLCDEHGNPVQLRGMSTHGIQW 

FDHCLTDSSLDALAYDWKADIIRLSMYIQEDGYETNPRGFTDRMHQLIDMATAR 

QLYVIVDWHILTPGDPHYNLDRAKTFFAEIAQRHASKTNVLYEIANEPNGVSWA 

SIKSYAEEVIPVIRQRDPDSVirVGTRGWSSLGVSEGSGPAEIAANPVNASNIMYAF 

HFYAASHRDNYLNALREASELFPVFVTEFGTETYTGDGANDFQMADRYIDLMA 

ERK1GWTKWNYSDDFRSGAVFQPGTCASGGPWSGSSLKASGQWVRSKLQS 
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Figure 14 

DNA sequence of CBHl -El fusion (2656 bases) 

T.reesei cbhl signal sequence+catalytic domain+linker+added amino acids 
SKR+Acidothennus cellulolyticus GH5A catalytic domain 

ATGTATCGGAAGTTGGCCGTCATCTCGGCCTTCTTGGCCACAGCTCGTGCTCA 

GTCGGCCTGCACTCTCCAATCGGAGACTCACCCGCCTCTGACATGGCAGAAA 

TGCTCGTCTGGTGGCACTTGCACTCAACAGACAGGCTCCGTGGTCATCGACG 

CCAACTGGCGCTGGACTCACGCTACGAACAGCAGCACGAACTGCTACGATGG 

CAACACTTGGAGCTCGACCCTATGTCCTGACAACGAGACCTGCGCGAAGAAC 

TGCTGTCTGGACGGTGCCGCCTACGCGTCCACGTACGGAGTTACCACGAGCG 

GTAACAGCCTCTCCATTGGCTTTGTCACCCAGTCTGCGCAGAAGAACGTTGGC 

GCTCGCCTTTACCTTATGGCGAGCGACACGACCTACCAGGAATTCACCCTGCT 

TGGCAACGAGTTCTCTTTCGATGTTGATGTTTCGCAGCTGCCGTAAGTGACTT 

ACCATGAACCCCTGACGTATCTTCTTGTGGGCTCCCAGCTGACTGGCCAATTT 

AAGGTGCGGCTTGAACGGAGCTCTCTACTTCGTGTCCATGGACGCGGATGGT 

GGCGTGAGCAAGTATCCCACCAACACCGCTGGCGCCAAGTACGGCACGGGGT 

ACTGTGACAGCCAGTGTCCCCGCGATCTGAAGTTCATCAATGGCCAGGCCAA 

CGTTGAGGGCTGGGAGCCGTCATCCAACAACGCAAACACGGGCATTGGAGG 

ACACGGAAGCTGCTGCTCTGAGATGGATATCTGGGAGGCCAACTCCATCTCC 

GAGGCTCTTACCCCCCACCCTTGCACGACTGTCGGCCAGGAGATCTGCGAGG 

GTGATGGGTGCGGCGGAACTTACTCCGATAACAGATATGGCGGCACTTGCGA 

TCCCGATGGCTGCGACTGGAACCCATACCGCCTGGGCAACACCAGCTTCTAC 

GGCCCTGGCTCAAGCTTTACCCTCGATACCACCAAGAAATTGACCGTTGTCAC 

CCAGTTCGAGACGTCGGGTGCCATCAACCGATACTATGTCCAGAATGGCGTC 

ACTTTCCAGCAGCCCAACGCCGAGCTTGGTAGTTACTCTGGCAACGAGCTCA 

ACGATGATTACTGCACAGCTGAGGAGGCAGAATTCGGCGGATCCTCTTTCTC 

AGACAAGGGCGGCGTGACTCAGTTCAAGAAGGCTACCTCTGGCGGCATGGTT 

CTGGTCATGAGTCTGTGGGATGATGTGAGTTTGATGGACAAACATGCGCGTT 

GACAAAGAGTCAAGCAGCTGACTGAGATGTTACAGTACTACGCCAACATGCT 

GTGGCTGGACTCCACCTACCCGACAAACGAGACCTCCTCCACACCCGGTGCC 

GTGCGCGGAAGCTGCTCCACCAGCTCCGGTGTCCCTGCTCAGGTCGAATCTC 

AGTCTCCCAACGCCAAGGTCACCTTCTCCAACATCAAGTTCGGACCCATTGGC 

AGCACCGGCAACCCTAGCGGCGGCAACCCTCCCGGCGGAAACCCGCCTGGCA 

CCACCACCACCCGCCGCCCAGCCACTACCACTGGAAGCTCTCCCGGACCTAC 

TAGTAAGCGGGCGGGCGGCGGCTATTGGCACACGAGCGGCCGGGAGATCCT 

GGACGCGAACAACGTGCCGGTACGGATCGCCGGCATCAACTGGTTTGGGTTC 

GAAACCTGCAATTACGTCGTGCACGGTCTCTGGTCACGCGACTACCGCAGCA 

TGCTCGACCAGATAAAGTCGCTCGGCTACAACACAATCCGGCTGCCGTACTC 

TGACGACATTCTCAAGCCGGGCACCATGCCGAACAGCATCAATTTTTACCAG 

ATGAATCAGGACCTGCAGGGTCTGACGTCCTTGCAGGTCATGGACAAAATCG 

TCGCGTACGCCGGTCAGATCGGCCTGCGCATCATTCTTGACCGCCACCGACC 

GGATTGCAGCGGGCAGTCGGCGCTGTGGTACACGAGCAGCGTCTCGGAGGCT 

ACGTGGATTTCCGACCTGCAAGCGCTGGCGCAGCGCTACAAGGGAAACCCGA 
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CGGTCGTCGGCTTTGACTTGCACAACGAGCCGCATGACCCGGCCTGCTGGGG 

CTGCGGCGATCCGAGCATCGACTGGCGATTGGCCGCCGAGCGGGCCGGAAAC 

GCCGTGCTCTCGGTGAATCCGAACCTGCTCATTTTCGTCGAAGGTGTGCAGAG 

CTACAACGGAGACTCCTACTGGTGGGGCGGCAACCTGCAAGGAGCCGGCCA 

GTACCCGGTCGTGCTGAACGTGCCGAACCGCCTGGTGTACTCGGCGGACGAC 

TACGCGACGAGCGTCTACCCGCAGACGTGGTTCAGCGATCCGACCTTCCCCA 

ACAACATGCCCGGCATCTGGAACAAGAACTGGGGATACCTCTTCAATCAGAA 

CATTGCACCGGTATGGCTGGGCGAATTCGGTACGACACTGCAATCCACGACC 

GACCAGACGTGGCTGAAGACGCTCGTCCAGTACCTACGGCCGACCGCGCAAT 

ACGGTGCGGACAGCTTCCAGTGGACCTTCTGGTCCTGGAACCCCGATTCCGG 

CGACACAGGAGGAATTCTCAAGGATGACTGGCAGACGGTCGACACAGTAAA 

AGACGGCTATCTCGCGCCGATCAAGTCGTCGATTTTCGATCCTGTCGGCTAA 
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Figure 15 

Amino acid sequence of CBH1-E1 fusion (841 ammo acids) 

T.reesei cbhl signal sequence+catalytic domain+linkemdded ammo acids 

SKR+Acidothermus cellulolyticus GH5A catalytic domain 

MYRKLAV1SAFLATARAQSACTLQSETHPPLTWQKCSSGGTCTQQTGSVVIDAN 

WR^ATOSSTNCYDGNTWSSTLCPDNETCAKNCCLDGAAYASTYGVTTSGNS 

LSIGFVTOSAQKl^GAPXYLMASDTTYQEFTLLGNEFSFDVDVSQLPC 

YT^SI^ADC^VSKYPTNTAGAKYGTGYCDSQCPRDLKFINGQANVEGWEPSSN 

NANTGIGGHGSCCSEMDIWEANS1SEALTPHPCTTVGQEICEGDGCGGTYSDNRY 

GGTCDPDGCDWT^PYRLGNTSFYGPGSSFTLDTTKKLTWTQFETSGAI^^ 

NGVTFOQPNAELGSYSGNELNDDYCTAEEAEFGGSSFSDKGGLTQFKKATSGGM 

VLVMSLWDDYYANMLWLDSTYPTNETSSTPGAVRGSCSTSSGVPAQVESQSPN 

AK VTFSNIKFGPIG STGNPS GGNPPGGNPPGTTTTRRP ATTTG S SPGPTSKRAGGG 

Y^TSGREILDANNVPVRIAGrNWFGFETCNYVVHGLWSRDYRSMLDQIKSLGY 

NTIRLPYSDDILKPGTMPNSINFYQMNQDLQGLTSLQVMDKIVAYAGQIGLRnLD 

RHRPDCSGQSALWYTSSVSEATWISDLQALAQRYKGNPTVVGFDLHNEPHDPAC 

WGGGDPSIDWRLAAERAGNAVLSVNPNLLIFVEGVQSYNGDSYWWGGNLQGA 

fiOYPWLNVPNRLVYSAHDYATSVYPQTWTSDPTFPNNMPGIWNKNWGYLFN 

ONIAPVWLGEFGTTLQSTTDQTWLKTLVQYLRPTAQYGADSFQWTFWSWNPDS 

GDTGGILKDD WQTVDTVKDG YL APIKS SIFDPVG 
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Figure 16 
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Figure 17 

DNA sequence of pTrex4 (10239 bases) 

AAGCTTAACTAGTACTTCTCGAGCTCTGTACATGTCCGGTCGCGACGTACGCG 

TATCGATGGCGCCAGCTGCAGGCGGCCGCCTGCAGCCACTTGCAGTCCCGTG 

GAATTCTCACGGTGAATGTAGGCCTTTTGTAGGGTAGGAATTGTCACTCAAGC 

ACCCCCAACCTCCATTACGCCTCCCCCATAGAGTTCCCAATCAGTGAGTCATG 

GCACTGTTCTCAAATAGATTGGGGAGAAGTTGACTTCCGCCCAGAGCTGAAG 

GTCGCACAACCGCATGATATAGGGTCGGCAACGGCAAAAAAGCACGTGGCT 

CACCGAAAAGCAAGATGTTTGCGATCTAACATCCAGGAACCTGGATACATCC 

ATCATCACGCACGACCACTTTGATCTGCTGGTAAACTCGTATTCGCCCTAAAC 

CGAAGTGACGTGGTAAATCTACACGTGGGCCCCTTTCGGTATACTGCGTGTGT 

CTTCTCTAGGTGCCATTCTTTTCCCTTCCTCTAGTGTTGAATTGTTTGTGTTGG 

AGTCCGAGCTGTAACTACCTCTGAATCTCTGGAGAATGGTGGACTAACGACT 

ACCGTGCACCTGCATCATGTATATAATAGTGATCCTGAGAAGGGGGGTTTGG 

AGCAATGTGGGACTTTGATGGTCATCAAACAAAGAACGAAGACGCCTCTTTT 

GCAAAGTTTTGTTTCGGCTACGGTGAAGAACTGGATACTTGTTGTGTCTTCTG 

TGTATTTTTGTGGCAACAAGAGGCCAGAGACAATCTATTCAAACACCAAGCT 

TGCTCTTTTGAGCTACAAGAACCTGTGGGGTATATATCTAGAGTTGTGAAGTC 

GGTAATCCCGCTGTATAGTAATACGAGTCGCATCTAAATACTCCGAAGCTGCT 

GCGAACCCGGAGAATCGAGATGTGCTGGAAAGCTTCTAGCGAGCGGCTAAAT 

TAGCATGAAAGGCTATGAGAAATTCTGGAGACGGCTTGTTGAATCATGGCGT 

TCCATTCTTCGACAAGCAAAGCGTTCCGTCGCAGTAGCAGGCACTCATTCCCG 

AAAAAACTCGGAGATTCCTAAGTAGCGATGGAACCGGAATAATATAATAGGC 

AATACATTGAGTTGCCTCGACGGTTGCAATGCAGGGGTACTGAGCTTGGACA 

TAACTGTTCCGTACCCCACCTCTTCTCAACCTTTGGCGTTTCCCTGATTCAGCG 

TACCCGTACAAGTCGTAATCACTATTAACCCAGACTGACCGGACGTGTTTTGC 

CCTTCATTTGGAGAAATAATGTCATTGCGATGTGTAATTTGCCTGCTTGACCG 

ACTGGGGCTGTTCGAAGCCCGAATGTAGGATTGTTATCCGAACTCTGCTCGTA 

GAGGCATGTTGTGAATCTGTGTCGGGCAGGACACGCCTCGAAGGTTCACGGC 
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AAGGGAAACCACCGATAGCAGTGTCTAGTAGCAACCTGTAAAGCCGCAATGC 

AGCATCACTGGAAAATACAAACCAATGGCTAAAAGTACATAAGTTAATGCCT 

AAAGAAGTCATATACCAGCGGCTAATAATTGTACAATCAAGTGGCTAAACGT 

ACCGTAATTTGCCAACGGCTTGTGGGGTTGCAGAAGCAACGGCAAAGCCCCA 

CTTCCCCACGTTTGTTTCTTCACTCAGTCCAATCTCAGCTGGTGATCCCCCAAT 

TGGGTCGCTTGTTTGTTCCGGTGAAGTGAAAGAAGACAGAGGTAAGAATGTC 

TGACTCGGAGCGTTTTGCATACAACCAAGGGCAGTGATGGAAGACAGTGAAA 

TGTTGACATTCAAGGAGTATTTAGCCAGGGATGCTTGAGTGTATCGTGTAAG 

GAGGTTTGTCTGCCGATACGACGAATACTGTATAGTCACTTCTGATGAAGTGG 

tccatattgaaatgtaagtcggcactgaacaggcaaaagattgagttgaAac 

TGCCTAAGATCTCGGGCCCTCGGGCCTTCGGCCTTTGGGTGTACATGTTTGTG 

CTCCGGGCAAATGCAAAGTGTGGTAGGATCGAACACACTGCTGCCTTTACCA 

AGCAGCTGAGGGTATGTGATAGGCAAATGTTCAGGGGCCACTGCATGGTTTC 

GAATAGAAAGAGAAGCTTAGCCAAGAACAATAGCCGATAAAGATAGCCTCA 

TTAAACGGAATGAGCTAGTAGGCAAAGTCAGCGAATGTGTATATATAAAGGT 

TCGAGGTCCGTGCCTCCCTCATGCTCTCCCCATCTACTCATCAACTCAGATCC 

TCCAGGAGACTTGTACACCATCTTTTGAGGCACAGAAACCCAATAGTCAACC 

GCGGACTGCGCATCATGTATCGGAAGTTGGCCGTCATCTCGGCCTTCTTGGCC 

ACAGCTCGTGCTCAGTCGGCCTGCACTCTCCAATCGGAGACTCACCCGCCTCT 

GACATGGCAGAAATGCTCGTCTGGTGGCACTTGCACTCAACAGACAGGCTCC 

GTGGTCATCGACGCCAACTGGCGCTGGACTCACGCTACGAACAGCAGCACGA 

ACTGCTACGATGGCAACACTTGGAGCTCGACCCTATGTCCTGACAACGAGAC 

CTGCGCGAAGAACTGCTGTCTGGACGGTGCCGCCTACGCGTCCACGTACGGA 

GTTACCACGAGCGGTAACAGCCTCTCCATTGGCTTTGTCACCCAGTCTGCGCA 

GAAGAACGTTGGCGCTCGCCTTTACCTTATGGCGAGCGACACGACCTACCAG 

GAATTCACCCTGCTTGGCAACGAGTTCTCTTTCGATGTTGATGTTTCGCAGCT 

GCCGTAAGTGACTTACCATGAACCCCTGACGTATCTTCTTGTGGGCTCCCAGC 

TGACTGGCCAATTTAAGGTGCGGCTTGAACGGAGCTCTGTACTTCGTGTCCAT 

GGACGCGGATGGTGGCGTGAGCAAGTATCCCACCAACACCGCTGGCGCCAA 

GTACGGCACGGGGTACTGTGACAGCCAGTGTCCCCGCGATCTGAAGTTCATC 

AATGGCCAGGCCAACGTTGAGGGCTGGGAGCCGTCATCCAACAACGCAAAC 
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ACGGGCATTGGAGGACACGGAAGCTGCTGCTCTGAGATGGATATCTGGGAGG 

CCAACTCCATCTCCGAGGCTCTTACCCCCCACCCTTGCACGACTGTCGGCCAG 

GAGATCTGCGAGGGTGATGGGTGCGGCGGAACTTACTCCGATAACAGATATG 

GCGGCACTTGCGATCCCGATGGCTGCGACTGGAACCCATACCGCCTGGGCAA 

CACCAGCTTCTACGGCCCTGGCTCAAGCTTTACCCTCGATACCACCAAGAAAT 

TGACCGTTGTCACCCAGTTCGAGACGTCGGGTGCCATCAACCGATACTATGTC 

CAGAATGGCGTCACTTTCCAGCAGCCCAACGCCGAGCTTGGTAGTTACTCTG 

GCAACGAGCTCAACGATGATTACTGCACAGCTGAGGAGGCAGAATTCGGCGG 

ATCCTCTTTCTCAGACAAGGGCGGCCTGACTCAGTTCAAGAAGGCTACCTCTG 

GCGGCATGGTTCTGGTCATGAGTCTGTGGGATGATGTGAGTTTGATGGACAA 

ACATGCGCGTTGACAAAGAGTCAAGCAGCTGACTGAGATGTTACAGTACTAC 

GCCAACATGCTGTGGCTGGACTCCACCTACCCGACAAACGAGACCTCCTCCA 

CACCCGGTGCCGTGCGCGGAAGCTGCTCCACCAGCTCCGGTGTCCCTGCTCA 

GGTCGAATCTCAGTCTCCCAACGCCAAGGTCACCTTCTCCAACATCAAGTTCG 

GACCCATTGGCAGCACCGGCAACCCTAGCGGCGGCAACCCTCCCGGCGGAAA 

CCCGCCTGGCACCACCACCACCCGCCGCCCAGCCACTACCACTGGAAGCTCT 

CCCGGACCTACTAGTAAGCGGATAAGGCGCGCCGCGCGCCAGCTCCGTGCGA 

AAGCCTGACGCACCGGTAGATTCTTGGTGAGCCCGTATCATGACGGCGGCGG 

GAGCTACATGGCCCCGGGTGATTTATTTTTTTTGTATCTACTTCTGACCCTTTT 

CAAATATACGGTCAACTCATCTTTCACTGGAGATGCGGCCTGCTTGGTATTGC 

GATGTTGTCAGCTTGGCAAATTGTGGCTTTCGAAAACACAAAACGATTCCTTA 

GTAGCCATGCATTTTAAGATAACGGAATAGAAGAAAGAGGAAATTAAAAAA 

AAAAAAAAAACAAACATCCCGTTCATAACCCGTAGAATCGCCGCTCTTCGTG 

TATCCCAGTACCAGTTTATTTTGAATAGCTCGCCCGCTGGAGAGCATCCTGAA 

TGCAAGTAACAACCGTAGAGGCTGACACGGCAGGTGTTGCTAGGGAGCGTCG 

TGTTCTACAAGGCCAGACGTCTTCGCGGTTGATATATATGTATGTTTGACTGC 

AGGCTGCTCAGCGACGACAGTCAAGTTCGCCCTCGCTGCTTGTGCAATAATC 

GCAGTGGGGAAGCCACACCGTGACTCCCATCTTTCAGTAAAGCTCTGTTGGT 

GTTTATCAGCAATACACGTAATTTAAACTCGTTAGCATGGGGCTGATAGCTTA 

ATTACCGTTTACCAGTGCCGCGGTTCTGCAGCTTTCCTTGGCCCGTAAAATTC 

GGCGAAGCCAGCCAATCACCAGCTAGGCACCAGCTAAACCCTATAATTAGTC 
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TCTTATCAACACCATCCGCTCCCCCGGGATCAATGAGGAGAATGAGGGGGAT 

GCGGGGCTAAAGAAGCCTACATAACCCTCATGCCAACTCCCAGTTTACACTC 

GTCGAGCCAACATCCTGACTATAAGCTAACACAGAATGCCTCAATCCTGGGA 

AGAACTGGCCGCTGATAAGCGCGCCCGCCTCGCAAAAACCATCCCTGATGAA 

TGGAAAGTCCAGACGCTGCCTGCGGAAGACAGCGTTATTGATTTCCCAAAGA 

AATCGGGGATCCTTTCAGAGGCCGAACTGAAGATCACAGAGGCCTCCGCTGC 

AGATCTTGTGTCCAAGCTGGCGGCCGGAGAGTTGACCTCGGTGGAAGTTACG 

CTAGCATTCTGTAAACGGGCAGCAATCGCCCAGCAGTTAGTAGGGTCCCCTC 

TACCTCTCAGGGAGATGTAACAACGCCACCTTATGGGACTATCAAGCTGACG 

CTGGCTTCTGTGCAGACAAACTGCGCCCACGAGTTCTTCCCTGACGCCGCTCT 

CGCGCAGGCAAGGGAACTCGATGAATACTACGCAAAGCACAAGAGACCCGT 

TGGTCCACTCCATGGCCTCCCCATCTCTCTCAAAGACCAGCTTCGAGTCAAGG 

TACACCGTTGCCCCTAAGTCGTTAGATGTCCCTTTTTGTCAGCTAACATATGC 

CACCAGGGCTACGAAACATCAATGGGCTACATCTCATGGCTAAACAAGTACG 

ACGAAGGGGACTCGGTTCTGACAACCATGCTCCGCAAAGCCGGTGCCGTCTT 

CTACGTCAAGACCTCTGTCCCGCAGACCCTGATGGTCTGCGAGACAGTCAAC 

AACATCATCGGGCGCACCGTCAACCCACGCAACAAGAACTG'GTCGTGCGGCG 

GCAGTTCTGGTGGTGAGGGTGCGATCGTTGGGATTCGTGGTGGCGTCATCGG 

TGTAGGAACGGATATCGGTGGCTCGATTCGAGTGCCGGCCGCGTTCAACTTC 

CTGTACGGTCTAAGGCCGAGTCATGGGCGGCTGCCGTATGCAAAGATGGCGA 

ACAGCATGGAGGGTCAGGAGACGGTGCACAGCGTTGTCGGGCCGATTACGCA 

CTCTGTTGAGGGTGAGTCCTTCGCCTCTTCCTTCTTTTCCTGCTCTATACCAGG 

CCTCCACTGTCCTCCTTTCTTGCTTTTTATACTATATACGAGACCGGCAGTCAC 

TGATGAAGTATGTTAGACCTCCGCCTCTTCACCAAATCCGTCCTCGGTCAGGA 

GCCATGGAAATACGACTCCAAGGTCATCCCCATGCCCTGGCGCCAGTCCGAG 

TCGGACATTATTGCCTCCAAGATCAAGAACGGCGGGCTCAATATCGGCTACT 

ACAACTTCGACGGCAATGTCCTTCCACACCCTCCTATCCTGCGCGGCGTGGAA 

ACCACCGTCGCCGCACTCGCCAAAGCCGGTCACACCGTGACCCCGTGGACGC 

CATACAAGCACGATTTCGGCCACGATCTCATCTCCCATATCTACGCGGCTGAC 

GGCAGCGCCGACGTAATGCGCGATATCAGTGCATCCGGCGAGCCGGCGATTC 

CAAATATCAAAGACCTACTGAACCCGAACATCAAAGCTGTTAACATGAACGA 
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GCTCTGGGACACGCATCTCCAGAAGTGGAATTACCAGATGGAGTACCTTGAG 

AAATGGCGGGAGGCTGAAGAAAAGGCCGGGAAGGAACTGGACGCCATCATC 

GCGCCGATTACGCCTACCGCTGCGGTACGGCATGACCAGTTCCGGTACTATG 

GGTATGCCTCTGTGATCAACCTGCTGGATTTCACGAGCGTGGTTGTTCCGGTT 

ACCTTTGCGGATAAGAACATCGATAAGAAGAATGAGAGTTTCAAGGCGGTTA 

GTGAGCTTGATGCCCTCGTGCAGGAAGAGTATGATCCGGAGGCGTACCATGG 

GGCACCGGTTGCAGTGCAGGTTATCGGACGGAGACTCAGTGAAGAGAGGAC 

GTTGGCGATTGCAGAGGAAGTGGGGAAGTTGCTGGGAAATGTGGTGACTCCA 

TAGCTAATAAGTGTCAGATAGCAATTTGCACAAGAAATCAATACCAGCAACT 

GTAAATAAGCGCTGAAGTGACCATGCCATGCTACGAAAGAGCAGAAAAAAA 

CCTGCCGTAGAACCGAAGAGATATGACACGCTTCCATCTCTCAAAGGAAGAA 

TCCCTTCAGGGTTGCGTTTCCAGTCTAGACACGTATAACGGCACAAGTGTCTC 

TCACCAAATGGGTTATATCTCAAATGTGATCTAAGGATGGAAAGCCCAGAAT 

CTAGGCCTATTAATATTCCGGAGTATACGTAGCCGGCTAACGTTAACAACCG 

GTACCTCTAGAACTATAGCTAGCATGCGCAAATTTAAAGCGCTGATATCGAT 

CGCGCGCAGATCCATATATAGGGCCCGGGTTATAATTACCTCAGGTCGACGT 

CCCATGGCCATTCGAATTCGTAATCATGGTCATAGCTGTTTCCTGTGTGAAAT 

TGTTATCCGCTCACAATTCCACACAACATACGAGCCGGAAGCATAAAGTGTA 

AAGCCTGGGGTGCCTAATGAGTGAGCTAACTCACATTAATTGCGTTGCGCTC 

ACTGCCCGCTTTCCAGTCGGGAAACCTGTCGTGCCAGCTGCATTAATGAATCG 

GCCAACGCGCGGGGAGAGGCGGTTTGCGTATTGGGCGCTCTTCCGCTTCCTC 

GCTCACTGACTCGCTGCGCTCGGTCGTTCGGCTGCGGCGAGCGGTATCAGCTC 

ACTCAAAGGCGGTAATACGGTTATCCACAGAATCAGGGGATAACGCAGGAA 

AGAACATGTGAGCAAAAGGCCAGCAAAAGGCCAGGAACCGTAAAAAGGCCG 

CGTTGCTGGCGTTTTTCCATAGGCTCCGCCCCCCTGACGAGCATCACAAAAAT 

CGACGCTCAAGTCAGAGGTGGCGAAACCCGACAGGACTATAAAGATACCAG 

GCGTTTCCCCCTGGAAGCTCCCTCGTGCGCTCTCCTGTTCCGACCCTGCCGCT 

TACCGGATACCTGTCCGCCTTTCTCCCTTCGGGAAGCGTGGCGCTTTCTCATA 

GCTCACGCTGTAGGTATCTGAGTTCGGTGTAGGTCGTTCGCTCCAAGCTGGGC 

TGTGTGCACGAACCCCCCGTTCAGCCCGACCGCTGCGCCTTATCCGGTAACTA . 

TCGTCTTGAGTCCAACCCGGTAAGACACGACTTATCGCCACTGGCAGCAGCC 
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ACTGGTAACAGGATTAGCAGAGCGAGGTATGTAGGCGGTGCTACAGAGTTCT 
TGAAGTGGTGGCCTAACTACGGCTACACTAGAAGAACAGTATTTGGTATCTG 
CGCTCTGCTGAAGCCAGTTACCTTCGGAAAAAGAGTTGGTAGCTCTTGATCCG 
GCAAACAAACCACCGCTGGTAGCGGTGGTTTTTTTGTTTGCAAGCAGCAGATT 
ACGCGCAGAAAAAAAGGATCTCAAGAAGATCCTTTGATCTTTTCTACGGGGT 
CTGACGCTCAGTGGAACGAAAACTCACGTTAAGGGATTTTGGTCATGAGATT 
ATCAAAAAGGATCTTCACCTAGATCCTTTTAAATTAAAAATGAAGTTTTAAAT 
CAATCTAAAGTATATATGAGTAAACTTGGTCTGACAGTTACCAATGCTTAATC 
AGTGAGGCACCTATCTCAGCGATCTGTCTATTTCGTTCATCCATAGTTGCCTG 
ACTCCCCGTCGTGTAGATAACTACGATACGGGAGGGCTTACCATCTGGCCCC 
AGTGCTGCAATGATACCGCGAGACCCACGCTCACCGGCTCCAGATTTATCAG 
CAATAAACCAGCCAGCCGGAAGGGCCGAGCGCAGAAGTGGTCCTGCAACTTT 
ATCCGCCTCCATCCAGTCTATTAATTGTTGCCGGGAAGCTAGAGTAAGTAGTT 
CGCCAGTTAATAGTTTGCGCAACGTTGTTGCCATTGCTACAGGCATCGTGGTG 
TCACGCTCGTCGTTTGGTATGGCTTCATTCAGCTCCGGTTCCCAACGATCAAG 
GCGAGTTACATGATCCCCCATGTTGTGCAAAAAAGCGGTTAGCTCCTTCGGTC 
CTCCGATCGTTGTCAGAAGTAAGTTGGCCGCAGTGTTATCACTCATGGTTATG 
GCAGCACTGCATAATTCTCTTACTGTCATGCCATCCGTAAGATGCTTTTCTGT 
GACTGGTGAGTACTCAACCAAGTCATTCTGAGAATAGTGTATGCGGCGACCG 
AGTTGCTCTTGCCCGGCGTCAATACGGGATAATACGGCGCCACATAGCAGAA 
CTTTAAAAGTGCTCATCATTGGAAAACGTTCTTCGGGGCGAAAACTCTCAAG 
GATCTTACCGCTGTTGAGATCCAGTTCGATGTAACCCACTCGTGCACCCAACT 
GATCTTCAGCATCTTTTACTTTCACCAGCGTTTCTGGGTGAGCAAAAACAGGA 
AGGCAAAATGCCGCAAAAAAGGGAATAAGGGCGACACGGAAATGTTGAATA 
• CTCATACTCTTCCTTTTTCAATATTATTGAAGCATTTATCAGGGTTATTGTCTC 
ATGAGCGGATACATATTTGAATGTATTTAGAAAAATAAACAAATAGGGGTTC 
CGCGCACATTTCCCCGAAAAGTGCCACCTGACGTCTAAGAAACCATTATTAT 
CATGACATTAACCTATAAAAATAGGCGTATCACGAGGCCCTTTCGTCTCGCGC 
GTTTCGGTGATGACGGTGAAAACCTCTGACACATGCAGCTCCCGGAGACGGT 
CACAGCTTGTCTGTAAGCGGATGCCGGGAGCAGACAAGCCCGTCAGGGCGCG 
TCAGCGGGTGTTGGCGGGTGTCGGGGCTGGCTTAACTATGCGGCATCAGAGC 
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agattgtactgagagtgcaccataaaattgtaaacgttaatattttgttaaaa 

ttcgcgttaaatttttgttaaatcagctcattttttaaccaataggcc 

cggcaaaatcccttataaatcaaaagaatagcccgagatagggttgagtgtt 

gttccagtttggaacaagagtccactattaaagaacgtggactccaacgtca 

aagggcgaaaaaccgtctatcagggcgatggcccactacgtgaaccatcacc 

caaatcaagttttttggggtcgaggtgccgtaaagcactaaatcggaaccct 

aaagggagcccccgatttagagcttgacggggaaagccggcgaAcgtggcg 

agaaaggaagggaagaaagcgaaaggagcgggcgctagggcgctggcaag 

tgtagcggtcacgctgcgcgtaaccaccacacccgccgcgcttaatgcgccg 

ctacagggcgcgtactatggttgctttgacgtatgcggtgtgaaataccgca 

cagatgcgtaaggagaaaataccgcatcaggcgccattcgccattcaggctg 

cgcaactgttgggaagggcgatcggtgcgggcctcttcgctattacgccagc 

tggcgaaagggggatgtgctgcaaggcgattaagttgggtaacgccagggtt 

ttcccagtcacgacgttgtaaaacgacggccagtgcc 
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